Skip to content
This repository was archived by the owner on Jan 28, 2026. It is now read-only.

[NPU] Further tune Qwen2-7B accuracy#12456

Open
Oscilloscope98 wants to merge 8 commits intointel:mainfrom
Oscilloscope98:qwen2-tune-npu-acc
Open

[NPU] Further tune Qwen2-7B accuracy#12456
Oscilloscope98 wants to merge 8 commits intointel:mainfrom
Oscilloscope98:qwen2-tune-npu-acc

Conversation

@Oscilloscope98
Copy link
Contributor

@Oscilloscope98 Oscilloscope98 commented Nov 27, 2024

Description

https://github.com/analytics-zoo/nano/issues/1741#issuecomment-2503304990

  • Improve Qwen2-7B INT4 CW acc_lib accuracy with strategy 7&9 in above issue
  • Fit with MiniCPM-V-2_6
  • Fit with pipeline
  • Check on save & load
  • Update examples

@Oscilloscope98 Oscilloscope98 marked this pull request as draft November 27, 2024 11:07
@Oscilloscope98 Oscilloscope98 marked this pull request as ready for review November 28, 2024 03:19
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant